A Statistical Text Mining Method for Patent Analysis
نویسنده
چکیده
Most text data from diverse document databases are unsuitable for analytical methods based on statistics and machine learning algorithms. Patent documents are also compiled into text datasets. Similar to other document datasets, we therefore need to transform patent documents into structured data for a statistical analysis. This transformation is performed using the preprocessing of text mining techniques. We can analyze the patent documents after their preprocessing. For a patent analysis, two phases, preprocessing and analysis, are required. In this paper, we try to combine the two phases into one. We propose a statistical text mining method to improve the performance of a patent data analysis. Our proposed method carries out text mining and a statistical analysis at the same time. To show the contribution of our study, we illustrate how it can be applied in a real domain using a target technology.
منابع مشابه
Emerging Technology Forecasting Using New Patent Information Analysis
Emerging technology drives technological development and innovation in diverse fields of technology. Emerging technology forecasting can predict the possible areas of emerging technology. However, it is difficult to forecast the emerging technology because most technology forecasting tasks depend on the subjective experience of experts. Patent analysis is an objective method to recognize the tr...
متن کاملModeling Patent Quality: A System for Large-scale Patentability Analysis using Text Mining
Current patent systems face a serious problem of declining quality of patents as the larger number of applications make it difficult for patent officers to spend enough time for evaluating each application. For building a better patent system, it is necessary to define a public consensus on the quality of patent applications in a quantitative way. In this article, we tackle the problem of asses...
متن کاملPsalm – Patent Mining Tool for Competitive Intelligence
Original scientific paper Patent document is a valuable source of information. However, it is neither easy to extract useful information from patents nor simple to track evidence about all patents that may be relevant. This paper describes PSALM (Patent Search and Analysis for Landscaping and Management), a recently developed software tool for competitive intelligence based on patent data. PSAL...
متن کاملMining Process , Techniques and Tools : an Overview
Text Mining has become an important research area, which refers to the application of machine learning (or data mining) techniques in the study of Information Retrieval and Natural Language Processing. In sense, it is defined as the way of discovering knowledge from ubiquitous text data which are easily accessible over the Internet or the Intranet. The survey of Text Mining techniques, Text Min...
متن کاملText mining techniques for patent analysis
Patent documents contain important research results. However, they are lengthy and rich in technical terminology such that it takes a lot of human efforts for analyses. Automatic tools for assisting patent engineers or decision makers in patent analysis are in great demand. This paper describes a series of text mining techniques that conforms to the analytical process used by patent analysts. T...
متن کامل